Systems and methods for disparate data source aggregation, self-adjusting data model and API

ABSTRACT

A disparate data source aggregation system and methods are provided which may pull or retrieve talent data or features from disparate data sources, automatically correlate the data across the different data sources, build a self-adjusting system database that captures the talent data from the disparate data sources, and lets users search, query and build model insights on the aggregated data of the system database without human intervention. A method for disparate data source aggregation may include: extracting a first feature set having a first extracted feature and a second feature set having a second extracted feature; determining, if the first extracted feature of the first feature set matches the second extracted feature of the second feature set; and aggregating the first feature set with the second feature set if the first extracted feature of the first feature set matches the second extracted feature of the second feature set.

CLAIM OF PRIORITY

This application claims priority under 35 USC § 120 to U.S. patentapplication Ser. No. 16/172,755 filed on Oct. 27, 2018, entitled“SYSTEMS AND METHODS FOR DISPARATE DATA SOURCE AGGREGATION,SELF-ADJUSTING DATA MODEL AND API”; which claims priority under 35 USC §119(e) to U.S. Provisional Patent Application Ser. No. 62/580,027, filedon Nov. 1, 2017, entitled “SYSTEMS AND METHODS FOR DISPARATE DATA SOURCEAGGREGATION, SELF-ADJUSTING DATA MODEL AND API”; the entire contents ofeach and together are hereby incorporated by reference in its entirety.

FIELD OF THE INVENTION

This patent specification relates to the field of data processing anddata management. More specifically, this patent specification relates tosystems and methods that are configured to enable data processing anddata management by aggregating data from disparate data sources.

BACKGROUND

Talent systems, such as LinkedIn, Indeed and internal talent data areused to store and maintain data describing various individuals typicallyin a job-seeking or employment history or skills context. Numeroustalent system websites exist and new ones are being launched frequently.Most people who use a talent system are members of more than one talentsystem website or database.

Talent systems have their own unique data models and can contain bothstructured and unstructured data on a plurality of individuals. Thisdata is commonly organized based on one or more identifiers that areunique to each individual. An Applicant Tracking System (ATS) is oneexample. In an ATS system, the system primarily tracks candidates andtheir related profile data, and the unique identifier could be candidateid or email address or any other attribute that identifies thecandidate. Another example is a Human Resources Information System(HRIS). A HRIS system primarily tracks employees, their internal jobhistory and performance review data. In HRIS systems, the uniqueidentifier could be employee ID number, social security number, or anyother attribute that identifies the employee. A third example is aCandidate Relationship Management (CRM) system. A CRM system managesTalent pipeline, prospects, their interests, willingness to relocate,potential job titles, etc. In CRM systems, the unique identifier couldbe a contact ID number, email address or any other attribute thatidentifies the contact.

To bring all the data together from different talent systems and createan aggregated profile normally requires an army of developers whounderstand these systems well and understand the unique identifier ineach of these systems. Developers build programs to import the data fromthese systems into staging tables. These programs use traditional SQLjoins on the unique identifiers to identify datasets that overlap acrosssystems, build temporary tables to move the data into a singleaggregated profile. Any time the data model or attributes change in thesource system, the whole development cycle starts over again to recreatethese temporary tables, capture the new data model, attributes and pullthem all together. If there is no unique identifier in the sourcesystem, the developers run into a roadblock.

Therefore, a need exists for novel systems and methods for dataprocessing and data management for talent systems. A further need existsfor novel systems and methods that are configured to enable dataprocessing and data management by aggregating data from disparate datasources. Finally, a need exists for novel systems and methods for talentdata from disparate systems, automatically correlates Talent data acrosssystems, builds a self-adjusting data model that captures data fromdisparate systems and lets users search, query and build model insightson the aggregated data sets without human intervention.

BRIEF SUMMARY OF THE INVENTION

Systems and computer implemented methods for disparate data sourceaggregation, self-adjusting data model and API may include a disparatedata source aggregation system which may pull or retrieve talent datafrom disparate data sources, automatically correlates the data acrossthe different data sources, build a self-adjusting system database thatcaptures the talent data from the disparate data sources, and lets userssearch, query and build model insights on the aggregated data of thesystem database without human intervention.

In some embodiments, a computer implemented method for disparate datasource aggregation is provided. The method may include the steps of:extracting, via a computing device processor, a first feature set havinga first extracted feature from a first data set of a first data sourceand a second feature set having a second extracted feature from a seconddata set of a second data source; determining, via a computing deviceprocessor, if the first extracted feature of the first feature setmatches the second extracted feature of the second feature set; andaggregating, via a computing device processor, the first feature setwith the second feature set if the first extracted feature of the firstfeature set matches the second extracted feature of the second featureset.

In further embodiments, a computer implemented method for disparate datasource aggregation may include the steps of: extracting feature datafrom a data set of one or more data sources; automatically correlatingand augmenting the extracted features against the disparate data sets orfeature sets for individuals without the need for a unique identifierfor each individual to compare the extracted features against thedisparate data sets; creating and aggregating profile records with thefeature data into a system database; automatically self-adjusting theaggregated profile records of the system database; providing aggregateddata from the system database to other systems; and providing aself-adjusting application program interface for searching, querying,and modeling insights of the data in the system database.

According to another embodiment consistent with the principles of theinvention, a computer implemented method for creating and aggregatingprofile records is provided. In some embodiments, the method may includethe steps of: receiving extracted feature sets from a data set of one ormore data sources; determining feature sets that substantial match oneor more feature sets from disparate data sets; creating profile recordsfrom feature set and any substantially matching feature sets; andstoring profile records in system database.

According to yet another embodiment consistent with the principles ofthe invention, a computer implemented method for managing a self-adjustaggregated system database is provided. In some embodiments, the methodmay include the steps of: obtaining one or more data sets; extractingfeature sets from the data sets; determining feature sets thatsubstantial match feature sets from a system database; and updatingfeature sets of system database with features from substantiallymatching feature sets.

According to still another embodiment consistent with the principles ofthe invention, a computer implemented method of providing aself-adjusting application program interface is provided. In someembodiments, the method may include the steps of: receiving, via acomputing device processor, a first query from a first programmableinterface; searching, via a computing device processor, a systemdatabase for data that satisfies the first query; and outputting data,via a computing device processor, via the first programmable interfacedata that satisfies the first query. For example, the method ofproviding a self-adjusting application program interface may receive aquery that may be constructed to find features of an aggregated profilethat span disparate data sources, such that the query may be used toretrieve aggregated profiles for user having a skill X in a first datasource and having a performance rating of 4 in a second data source inan aggregated profile record of the system.

In further embodiments, a computer implemented method of providing aself-adjusting application program interface the system database mayinclude data obtained by a computer implemented method for disparatedata source aggregation, the method including the steps of: extracting,via a computing device processor, a first feature set having a firstextracted feature from a first data set of a first data source and asecond feature set having a second extracted feature from a second dataset of a second data source; determining, via a computing deviceprocessor, if the first extracted feature of the first feature setmatches the second extracted feature of the second feature set; andaggregating, via a computing device processor, the first feature setwith the second feature set if the first extracted feature of the firstfeature set matches the second extracted feature of the second featureset.

BRIEF DESCRIPTION OF THE DRAWINGS

Some embodiments of the present invention are illustrated as an exampleand are not limited by the figures of the accompanying drawings, inwhich like references may indicate similar elements and in which:

FIG. 1 depicts an illustrative example of some of the components andcomputer implemented methods which may be found in a disparate datasource aggregation system according to various embodiments describedherein.

FIG. 2 illustrates a block diagram showing an example of a server whichmay be used by the system as described in various embodiments herein.

FIG. 3 shows a block diagram illustrating an example of a client devicewhich may be used by the system as described in various embodimentsherein.

FIG. 4 depicts a block diagram illustrating some applications of anaggregation module which may function as software rules enginesaccording to various embodiments described herein.

FIG. 5 illustrates a block diagram of an example of a first feature setwhich may contain a number of extracted data fields or extractedfeatures according to various embodiments described herein.

FIG. 6 shows a block diagram of an example of a second feature set whichmay contain a number of extracted data fields or extracted featuresaccording to various embodiments described herein.

FIG. 7 depicts a block diagram of an example of a third feature setwhich may contain a number of extracted data fields or extractedfeatures according to various embodiments described herein.

FIG. 8 illustrates a flow diagram depicting how engines of theaggregation module may generate an aggregated profile record accordingto various embodiments described herein.

FIG. 9 shows a block diagram of an example of an aggregated profilerecord which may contain a number of aggregated data fields oraggregated features according to various embodiments described herein.

FIG. 10 depicts a block diagram of an example of a method for disparatedata source aggregation according to various embodiments describedherein.

FIG. 11 illustrates a block diagram of an example of a method forcreating and aggregating profile records according to variousembodiments described herein.

FIG. 12 shows a block diagram of an example of a method for managing aself-adjusting aggregated system database according to variousembodiments described herein.

FIG. 13 depicts a block diagram of an example of a method of providing aself-adjusting application program interface according to variousembodiments described herein.

DETAILED DESCRIPTION OF THE INVENTION

The terminology used herein is for the purpose of describing particularembodiments only and is not intended to be limiting of the invention. Asused herein, the term “and/or” includes any and all combinations of oneor more of the associated listed items. As used herein, the singularforms “a,” “an,” and “the” are intended to include the plural forms aswell as the singular forms, unless the context clearly indicatesotherwise. It will be further understood that the terms “comprises”and/or “comprising,” when used in this specification, specify thepresence of stated features, steps, operations, elements, and/orcomponents, but do not preclude the presence or addition of one or moreother features, steps, operations, elements, components, and/or groupsthereof.

Unless otherwise defined, all terms (including technical and scientificterms) used herein have the same meaning as commonly understood by onehaving ordinary skill in the art to which this invention belongs. Itwill be further understood that terms, such as those defined in commonlyused dictionaries, should be interpreted as having a meaning that isconsistent with their meaning in the context of the relevant art and thepresent disclosure and will not be interpreted in an idealized or overlyformal sense unless expressly so defined herein.

Definitions

As used herein, the term “computer” refers to a machine, apparatus, ordevice that is capable of accepting and performing logic operations fromsoftware code. The term “application”, “software”, “software code” or“computer software” refers to any set of instructions operable to causea computer to perform an operation. Software code may be operated on bya “rules engine” or processor. Thus, the methods and systems of thepresent invention may be performed by a computer based on instructionsreceived by computer software.

The term “client device” or sometimes “electronic device” or just“device” as used herein is a type of computer generally operated by aperson or user of the system. In some embodiments, a client device is asmartphone or computer configured to receive and transmit data to aserver or other electronic device which may be operated locally or inthe cloud. Non-limiting examples of client devices include: personalcomputers (PCs), workstations, laptops, tablet PCs including the iPad,cell phones including iOS phones made by Apple Inc., Android OS phones,Microsoft OS phones, Blackberry phones, or generally any electronicdevice capable of running computer software and displaying informationto a user. Certain types of client devices which are portable and easilycarried by a person from one location to another may sometimes bereferred to as a “mobile device” or “portable device”. Some non-limitingexamples of mobile devices include: cell phones, smartphones, tabletcomputers, laptop computers, wearable computers such as Apple Watch,other smartwatches, Fitbit, other wearable fitness trackers, GoogleGlasses, and the like.

The term “computer readable medium” as used herein refers to any mediumthat participates in providing instructions to the processor forexecution. A computer readable medium may take many forms, including butnot limited to, non-volatile media, volatile media, and transmissionmedia. Non-volatile media includes, for example, optical, magneticdisks, and magneto-optical disks, such as the hard disk or the removablemedia drive. Volatile media includes dynamic memory, such as the mainmemory. Transmission media includes coaxial cables, copper wire andfiber optics, including the wires that make up the bus. Transmissionmedia may also take the form of acoustic or light waves, such as thosegenerated during radio wave and infrared data communications.

As used herein the term “data network” or “network” shall mean aninfrastructure capable of connecting two or more computers such asclient devices either using wires or wirelessly allowing them totransmit and receive data. Non-limiting examples of data networks mayinclude the internet or wireless networks or (i.e. a “wireless network”)which may include Wifi and cellular networks. For example, a network mayinclude a local area network (LAN), a wide area network (WAN) (e.g., theInternet), a mobile relay network, a metropolitan area network (MAN), anad hoc network, a telephone network (e.g., a Public Switched TelephoneNetwork (PSTN)), a cellular network, or a voice-over-IP (VoIP) network.

As used herein, the term “database” shall generally mean a digitalcollection of data or information. The present invention uses novelmethods and processes to store, link, and modify information suchdigital images and videos and user profile information. For the purposesof the present disclosure, a database may be stored on a remote serverand accessed by a client device through the internet (i.e., the databaseis in the cloud) or alternatively in some embodiments the database maybe stored on the client device or remote computer itself (i.e., localstorage). A “data store” as used herein may contain or comprise adatabase (i.e. information and data from a database may be recorded intoa medium on a data store).

In describing the invention, it will be understood that a number oftechniques and steps are disclosed. Each of these has individual benefitand each can also be used in conjunction with one or more, or in somecases all, of the other disclosed techniques. Accordingly, for the sakeof clarity, this description will refrain from repeating every possiblecombination of the individual steps in an unnecessary fashion.Nevertheless, the specification and claims should be read with theunderstanding that such combinations are entirely within the scope ofthe invention and the claims.

New systems and methods for enabling data processing and data managementby aggregating data from disparate data sources are discussed herein. Inthe following description, for purposes of explanation, numerousspecific details are set forth in order to provide a thoroughunderstanding of the present invention. It will be evident, however, toone skilled in the art that the present invention may be practicedwithout these specific details.

The present disclosure is to be considered as an exemplification of theinvention, and is not intended to limit the invention to the specificembodiments illustrated by the figures or description below.

The present invention will now be described by example and throughreferencing the appended figures representing preferred and alternativeembodiments. As perhaps best shown by FIG. 1 , an illustrative exampleof some of the physical components which may comprise a disparate datasource aggregation system (“the system”) 100 according to someembodiments is presented. The system 100 is configured to facilitate thetransfer of data and information between one or more access points 103,client devices 400, and servers 300 over a data network 105. A datastore 308 accessible by the server 300 may contain one or moredatabases. Each user device 400 may send data to and receive data fromthe data network 105 through a network connection 104 with an accesspoint 103. The data may comprise any information that one or more users101 or individuals desire to input into the system 100 includinginformation describing one or more users 101 or individuals, informationdescribing the actions of one or more users 101 or individuals,information requested by one or more users 101 or individuals,information supplied by one or more users 101 or individuals, and anyother information which a user 101 or individual may desire to input orenter into the system 100.

In this example, the system 100 comprises at least one client device 400(but preferably more than two client devices 400) configured to beoperated by one or more users 101. Client devices 400 can be mobiledevices, such as laptops, tablet computers, personal digital assistants,smart phones, and the like, that are equipped with a wireless networkinterface capable of sending data to one or more servers 300 with accessto one or more data stores 308 over a network 105 such as a wirelesslocal area network (WLAN). Additionally, client devices 400 can be fixeddevices, such as desktops, workstations, and the like, that are equippedwith a wireless or wired network interface capable of sending data toone or more servers 300 with access to one or more data stores 308 overa wireless or wired local area network 105. The present invention may beimplemented on at least one client device 400 and/or server 300programmed to perform one or more of the steps described herein. In someembodiments, more than one client device 400 and/or server 300 may beused, with each being programmed to carry out one or more steps of amethod or process described herein.

The system 100 is configured to automatically aggregate talent data fromone or more data sources 111, 112, 113. In preferred embodiments, thesystem 100 is configured to automatically aggregate talent data from twoor more disparate data sources 111, 112, 113. The talent data may beaggregated into a system database 330. Users 101 may search, query, andmodel insights into the data of the system database 330.

Data sources 111, 112, 113, may comprise talent systems, such asLinkedIn, Indeed, and internal talent data systems, such as thosemanaged by Taleo or Kenexa, which store and maintain data describingvarious individuals, typically in a job-seeking or employment history orskills context. Data stored and maintained by a data source 111, 112,113, may be stored in a data store 308 accessible by one or more servers300. Most individuals who use a data source 111, 112, 113, are membersof more than one data source 111, 112, 113. Each data source 111, 112,113, may comprise a profile for each individual that is a member of thedata source 111, 112, 113, however, some individuals may have more thanone profile on one or more data sources 111, 112, 113. Each profile foran individual may contain a number of features or data fields which maydescribe that individual. Example features may include the name of theindividual; the current employer of the individual; a previous employerof the individual; the current job title of the individual; previous jobtitle of the individual; and/or any other information which may describethe individual. Data sources 111, 112, 113, that organize their data ina unique or proprietary manner may be referred to as disparate datasources 111, 112, 113, in so much as they may have different features,different names for features, different data sorted in their features,or any other types of disparate data. The system 100 is able toaggregate or collect talent data from disparate data sources 111, 112,113, into a system database 330 and to allow users 101 to search, query,and model insights into the data of the system database 330.

Referring now to FIG. 2 , in an exemplary embodiment, a block diagramillustrates a server 300 of which one or more may be used in the system100 or standalone. The server 300 may be a digital computer that, interms of hardware architecture, generally includes a processor 302,input/output (I/O) interfaces 304, a network interface 306, a data store308, and memory 310. It should be appreciated by those of ordinary skillin the art that FIG. 2 depicts the server 300 in an oversimplifiedmanner, and a practical embodiment may include additional components andsuitably configured processing logic to support known or conventionaloperating features that are not described in detail herein. Thecomponents (302, 304, 306, 308, and 310) are communicatively coupled viaa local interface 312. The local interface 312 may be, for example butnot limited to, one or more buses or other wired or wirelessconnections, as is known in the art. The local interface 312 may haveadditional elements, which are omitted for simplicity, such ascontrollers, buffers (caches), drivers, repeaters, and receivers, amongmany others, to enable communications. Further, the local interface 312may include address, control, and/or data connections to enableappropriate communications among the aforementioned components.

The processor 302 is a hardware device for executing softwareinstructions. The processor 302 may be any custom made or commerciallyavailable processor, a central processing unit (CPU), an auxiliaryprocessor among several processors associated with the server 300, asemiconductor-based microprocessor (in the form of a microchip or chipset), or generally any device for executing software instructions. Whenthe server 300 is in operation, the processor 302 is configured toexecute software stored within the memory 310, to communicate data toand from the memory 310, and to generally control operations of theserver 300 pursuant to the software instructions. The I/O interfaces 304may be used to receive user input from and/or for providing systemoutput to one or more devices or components. User input may be providedvia, for example, a keyboard, touch pad, and/or a mouse. System outputmay be provided via a display device and a printer (not shown). I/Ointerfaces 304 may include, for example, a serial port, a parallel port,a small computer system interface (SCSI), a serial ATA (SATA), a fibrechannel, Infiniband, iSCSI, a PCI Express interface (PCI-x), an infrared(IR) interface, a radio frequency (RF) interface, and/or a universalserial bus (USB) interface.

The network interface 306 may be used to enable the server 300 tocommunicate on a network, such as the Internet, the data network 105,the enterprise, and the like, etc. The network interface 306 mayinclude, for example, an Ethernet card or adapter (e.g., 10BaseT, FastEthernet, Gigabit Ethernet, 10 GbE) or a wireless local area network(WLAN) card or adapter (e.g., 802.11a/b/g/n). The network interface 306may include address, control, and/or data connections to enableappropriate communications on the network. A data store 308 may be usedto store data.

The data store 308 may include any of volatile memory elements (e.g.,random access memory (RAM, such as DRAM, SRAM, SDRAM, and the like)),nonvolatile memory elements (e.g., ROM, hard drive, tape, CDROM, and thelike), and combinations thereof. Moreover, the data store 308 mayincorporate electronic, magnetic, optical, and/or other types of storagemedia. In one example, the data store 308 may be located internal to theserver 300 such as, for example, an internal hard drive connected to thelocal interface 312 in the server 300. Additionally in anotherembodiment, the data store 308 may be located external to the server 300such as, for example, an external hard drive connected to the I/Ointerfaces 304 (e.g., SCSI or USB connection). In a further embodiment,the data store 308 may be connected to the server 300 through a network105, such as, for example, a network attached file server. Preferably,the system 100 may comprise a system database 330 which may be stored inone or more data stores 308.

The memory 310 may include any of volatile memory elements (e.g., randomaccess memory (RAM, such as DRAM, SRAM, SDRAM, etc.)), nonvolatilememory elements (e.g., ROM, hard drive, tape, CDROM, etc.), andcombinations thereof. Moreover, the memory 310 may incorporateelectronic, magnetic, optical, and/or other types of storage media. Notethat the memory 310 may have a distributed architecture, where variouscomponents are situated remotely from one another, but can be accessedby the processor 302. The software in memory 310 may include one or moresoftware programs, each of which includes an ordered listing ofexecutable instructions for implementing logical functions. The softwarein the memory 310 may include a suitable operating system (O/S) 314 andone or more programs 320.

The operating system 314 essentially controls the execution of othercomputer programs, such as the one or more programs 320, and providesscheduling, input-output control, file and data management, memorymanagement, and communication control and related services. Theoperating system 314 may be, for example Windows NT, Windows 2000,Windows XP, Windows Vista, Windows 7, Windows 8, Windows 10, WindowsServer 2003/2008 (all available from Microsoft, Corp. of Redmond,Wash.), Solaris (available from Sun Microsystems, Inc. of Palo Alto,Calif.), LINUX (or another UNIX variant) (available from Red Hat ofRaleigh, N.C. and various other vendors), Android and variants thereof(available from Google, Inc. of Mountain View, Calif.), Apple OS X andvariants thereof (available from Apple, Inc. of Cupertino, Calif.), orthe like. The one or more programs 320 may be configured to implementthe various processes, algorithms, methods, techniques, etc. describedherein.

Referring to FIG. 3 , in an exemplary embodiment, a block diagramillustrates a client device 400 of which one or more may be used in thesystem 100 or the like. The client device 400 can be a digital devicethat, in terms of hardware architecture, generally includes a processor402, input/output (I/O) interfaces 404, a radio 406, a data store 408,and memory 410. It should be appreciated by those of ordinary skill inthe art that FIG. 3 depicts the client device 400 in an oversimplifiedmanner, and a practical embodiment may include additional components andsuitably configured processing logic to support known or conventionaloperating features that are not described in detail herein. Thecomponents (402, 404, 406, 408, and 410) are communicatively coupled viaa local interface 412. The local interface 412 can be, for example butnot limited to, one or more buses or other wired or wirelessconnections, as is known in the art. The local interface 412 can haveadditional elements, which are omitted for simplicity, such ascontrollers, buffers (caches), drivers, repeaters, and receivers, amongmany others, to enable communications. Further, the local interface 412may include address, control, and/or data connections to enableappropriate communications among the aforementioned components.

The processor 402 is a hardware device for executing softwareinstructions. The processor 402 can be any custom made or commerciallyavailable processor, a central processing unit (CPU), an auxiliaryprocessor among several processors associated with the client device400, a semiconductor-based microprocessor (in the form of a microchip orchip set), or generally any device for executing software instructions.When the client device 400 is in operation, the processor 402 isconfigured to execute software stored within the memory 410, tocommunicate data to and from the memory 410, and to generally controloperations of the client device 400 pursuant to the softwareinstructions. In an exemplary embodiment, the processor 402 may includea mobile optimized processor such as optimized for power consumption andmobile applications.

The I/O interfaces 404 can be used to receive data and user input and/orfor providing system output. User input can be provided via a pluralityof I/O interfaces 404, such as a keypad, a touch screen, a camera, amicrophone, a scroll ball, a scroll bar, buttons, bar code scanner,voice recognition, eye gesture, and the like. System output can beprovided via a display device such as a liquid crystal display (LCD),touch screen, and the like. The I/O interfaces 404 can also include, forexample, a serial port, a parallel port, a small computer systeminterface (SCSI), an infrared (IR) interface, a radio frequency (RF)interface, a universal serial bus (USB) interface, and the like. The I/Ointerfaces 404 can include a graphical user interface (GUI) that enablesa user to interact with the client device 400. Additionally, the I/Ointerfaces 404 may be used to output notifications to a user and caninclude a speaker or other sound emitting device configured to emitaudio notifications, a vibrational device configured to vibrate, shake,or produce any other series of rapid and repeated movements to producehaptic notifications, and/or a light emitting diode (LED) or other lightemitting element which may be configured to illuminate to provide avisual notification.

The radio 406 enables wireless communication to an external accessdevice or network. Any number of suitable wireless data communicationprotocols, techniques, or methodologies can be supported by the radio406, including, without limitation: RF; IrDA (infrared); Bluetooth;ZigBee (and other variants of the IEEE 802.15 protocol); IEEE 802.11(any variation); Z-Wave wireless communications protocol used primarilyfor home automation; IEEE 802.16 (WiMAX or any other variation); DirectSequence Spread Spectrum; Frequency Hopping Spread Spectrum; Long TermEvolution (LTE); cellular/wireless/cordless telecommunication protocols(e.g. 3G/4G, etc.); wireless home network communication protocols;paging network protocols; magnetic induction; satellite datacommunication protocols; wireless hospital or health care facilitynetwork protocols such as those operating in the WMTS bands; GPRS;proprietary wireless data communication protocols such as variants ofWireless USB; and any other protocols for wireless communication. Thedata store 408 may be used to store data. The data store 408 may includeany of volatile memory elements (e.g., random access memory (RAM, suchas DRAM, SRAM, SDRAM, and the like)), nonvolatile memory elements (e.g.,ROM, hard drive, tape, CDROM, and the like), and combinations thereof.Moreover, the data store 408 may incorporate electronic, magnetic,optical, and/or other types of storage media.

The memory 410 may include any of volatile memory elements (e.g., randomaccess memory (RAM, such as DRAM, SRAM, SDRAM, etc.)), nonvolatilememory elements (e.g., ROM, hard drive, etc.), and combinations thereof.Moreover, the memory 410 may incorporate electronic, magnetic, optical,and/or other types of storage media. Note that the memory 410 may have adistributed architecture, where various components are situated remotelyfrom one another, but can be accessed by the processor 402. The softwarein memory 410 can include one or more software programs, each of whichincludes an ordered listing of executable instructions for implementinglogical functions. In the example of FIG. 3 , the software in the memorysystem 410 includes a suitable operating system (O/S) 414 and programs420.

The operating system 414 essentially controls the execution of othercomputer programs, and provides scheduling, input-output control, fileand data management, memory management, and communication control andrelated services. The operating system 414 may be, for example, LINUX(or another UNIX variant), Android (available from Google), Symbian OS,Microsoft Windows CE, Microsoft Windows 7 Mobile, iOS (available fromApple, Inc.), webOS (available from Hewlett Packard), Blackberry OS(Available from Research in Motion), and the like. The programs 420 mayinclude various applications, add-ons, etc. configured to provide enduser functionality with the client device 400. For example, exemplaryprograms 420 may include, but not limited to, a web browser, socialnetworking applications, streaming media applications, games, mappingand location applications, electronic mail applications, financialapplications, and the like. In a typical example, the end user typicallyuses one or more of the programs 420 along with a network 105 toexchange information with the system 100.

In some embodiments, the system 100 may comprise an aggregation module321 which may be a program 320 run by a server 300 and/or be a program420 run by a client device 400. An aggregation module 321 may becomprise logic operations and software rules engines, such as anextraction engine 322, aggregation engine 323, database engine 324,and/or an API engine 325, which may enable the system 100 to input,output, and manipulate data of a system database 330, data of any numberof data sources 111, 112, 113, or any other data accessible by thesystem 100.

FIG. 4 depicts a block diagram illustrating some applications of anaggregation module 321 which may function as software rules enginesaccording to various embodiments described herein. In preferredembodiments, an aggregation module 321 may include an extraction engine322, an aggregation engine 323, a database engine 324, and anapplication programming interface (API) engine 325. The aggregationmodule 321 and/or one or more of the engines 322, 323, 324, 325, mayoptionally be configured to run on a server 300 and/or a client device400 which may be in communication with a system database 330 accordingto various embodiments described. It should be understood that thefunctions attributed to the engines 322, 323, 324, 325, described hereinare exemplary in nature, and that in alternative embodiments, anyfunction attributed to any engine 322, 323, 324, 325, may be performedby one or more other engines 322, 323, 324, 325, or any other suitableprocessor logic.

Generally, the system database 330 having one or more, such as a firstfeature set 120A, a second feature set 120B, and a third feature set120C, and preferably a plurality of feature sets 120, may be maintained,searched, and otherwise manipulated by the database engine 324 of theaggregation module 321. In some embodiments, the extraction engine 322may comprise or function as extraction logic stored in the memory 310,410 which may be executable by the processor 302, 402, of a server 300and/or client device 400. In further embodiments, the extraction engine322 may be configured to extract data from the data sets 115, 116, 117,of one or more data sources 111, 112, 113, and to create one or morefeature sets 120A, 120B, 120C, having one or more features for anindividual.

In some embodiments, the aggregation engine 323 may comprise or functionas aggregation logic stored in the memory 310, 410 which may beexecutable by the processor 302, 402, of a server 300 and/or clientdevice 400. In further embodiments, the aggregation engine 323 may beconfigured to determine if one or more extracted features 121-131 of afeature set 120 matches one or more extracted features 121-131 ofanother second feature set 120. If one or more extracted features of thefeature sets 120 match, the aggregation engine 323 may be configured toaggregate or combine the feature sets 120 preferably by storing theaggregated feature sets 120 in an aggregated profile record 150 in thesystem database 330.

In some embodiments, the database engine 324 may comprise or function asdatabase logic stored in the memory 310, 410 which may be executable bythe processor 302, 402, of a server 300 and/or client device 400. Infurther embodiments, a database engine 324, optionally under thedirection of an aggregation engine 323, may be configured to send,receive, access, modify, and otherwise manipulate data in the systemdatabase 330 of a data store 308.

In some embodiments, the API engine 325 may comprise or function asinterface logic stored in the memory 310, 410 which may be executable bythe processor 302, 402, of a server 300 and/or client device 400. Infurther embodiments, an API engine 325 may be configured to receive aquery from a programmable interface; search the system database 330 fordata that satisfies the query; and output, via the respectiveprogrammable interface data that satisfies the query.

FIGS. 5-7 depict a block diagrams showing examples of feature sets 120A,120B, 120C, which may contain a number of extracted data fields orextracted features 121A-131C. A feature set 120A, 120B, 120C, may becreated by an extraction engine 322 by extracting talent data from adata set 115, 116, 117, or database of a data source 111, 112, 113.Talent data may describe the data in a feature set 120 or an aggregatedprofile record 150 of the system database 330 or the data in a data set115, 116, 117, of a data source 111, 112, 113, of an individual.Generally, a data set 115, 116, 117, may be likened to a profile for anindividual with each feature or data field preferably containing talentdata which may describe that individual in the system 100. For example,an extraction engine 322 may extract or retrieve a first feature set120A from a first data set 115 of a first data source 111. The firstdata source 111 may comprise LinkedIn, the first data set 115 maycomprise one or more data fields or features for a profile attributed toJohn Smith, and the first feature set 120A may comprise the data andfields of John Smith that were retrieved from LinkedIn by the extractionengine 322. As another example, the extraction engine 322 may create afirst feature set 120A having one or more first extracted features121A-131A from a first data set 115 of a first data source 111; a secondfeature set 120B having one or more second extracted features 121B-131Bfrom a second data set 116 of a second data source 112; and a thirdfeature set 120C having one or more third extracted features 121C-131Cfrom a third data set 117 of a third data source 113. It should beunderstood, that the extraction engine 322 may create any number offeature sets 120 having one or more extracted features 121-131 from adata set 115 of a any number of data sources.

In the examples shown in FIGS. 5-7 , the feature sets 120A, 120B, 120C,for an individual may comprise one or more extracted features, such as aname feature 121, a current employer feature 122, a first previousemployer feature 123, a second previous employer feature 124, a currentjob title feature 125, a first previous job title feature 126, a secondprevious job title feature 127, a contact address feature 128, a contactphone number feature 129, an education feature 130, and a certificationsfeature 131. Each feature may contain data for an individual thatgenerally is described by the name of the feature. For example and foran individual with the name of John Smith, data in the name feature121A, 121B, 121C, may comprise “John Smith”, data in the currentemployer feature 122 may comprise “ABC Company”, and data in the currentjob title feature 125 may comprise “project management leader”. However,it should be understood that a feature set 120 is not limited to thisinformation and other feature sets 120 may include any number offeatures.

FIG. 9 depicts a block diagram showing an example of an aggregatedprofile record 150 which may contain a number of data fields or features151-161. An aggregated profile record 150 may generally be likened to aprofile for an individual, created and stored by the system 100, witheach feature or data field preferably containing data which may describethat individual. All the talent data of an individual, having one ormore features that were extracted from a data set 115, 116, 117, of adata source 111, 112, 113, by the aggregation module 321 may be storedin an aggregated profile record 150 for that individual in the systemdatabase 330 by an aggregation engine 323. Preferably, an aggregationengine 323 may create an aggregated profile record 150 for eachindividual that the extraction engine 322 generates a feature set 120for.

In the example shown in FIG. 9 , the aggregated profile record 150 foran individual may comprise one or more aggregated features, such as aname feature 151, a current employer feature 152, a first previousemployer feature 153, a second previous employer feature 154, a currentjob title feature 155, a first previous job title feature 156, a secondprevious job title feature 157, a contact address feature 158, a contactphone number feature 159, an education feature 160, and a certificationsfeature 161. Each feature may contain data for an individual thatgenerally is described by the name of the feature. For example and foran individual with the name of John Smith, data in the name feature 151may comprise “John Smith”, data in the current employer feature 152 maycomprise “ABC Company”, and data in the current job title feature 155may comprise “project management leader”. However, it should beunderstood that an aggregated profile record 150 is not limited to thisinformation and other aggregated profile records 150 may include anynumber of aggregated features.

FIG. 10 illustrates a block diagram of an example of a computerimplemented method for disparate data source aggregation (“the method”)500 according to various embodiments described herein. The method 500may be performed by one or more engines 322, 323, 324, 325, of anaggregation module 321 to pull or retrieve talent data from disparatedata sources 111, 112, 113, automatically correlate the data across thedifferent data sources 111, 112, 113, build a self-adjusting systemdatabase 330 that captures the talent data from the disparate datasources 111, 112, 113, and lets users 101 search, query and build modelinsights on the aggregated data of the system database 330 without humanintervention.

In some embodiments, the method 500 may start 510 and the extractionengine 322 may extract feature data from one or more data sets 115, 116,117 of one or more data sources 111, 112, 113 in step 520 into one ormore feature sets 120A, 120B, 120C. The feature data or talent datacoming from data sources 111, 112, 113, could be structured feed orunstructured feeds. Structured feeds typically are CSV, Excel, Text,XML, JSON, etc., and unstructured feeds typically are a resume,document, image, html, blogs, web page, etc. For unstructured data, theextraction engine 322 may run natural language processing to extractfeatures in a document. Example features may include: name, location,skills, certifications, work experience, education, certifications andlanguages.

In step 530, the aggregation engine 323 may compare extracted features121-131 against disparate data sets (such as the extracted features121-131 of one or more other feature sets 120A, 120B, 120C) as discussedfurther in FIG. 11 . In some embodiments, step 530 may be accomplishedby comparing a set of extracted features 121-131 of a first feature set120A from a first data set 115 of a first data source 111 to the set ofextracted features 121-131 of a second feature set 120B from a seconddata set 116 of a second data source 112 and to the set of extractedfeatures 121-131 of a third feature set 120C from a third data set 117of a third data source 113 (or any number of data sources 111, 112,113).

In step 540, the aggregation engine 323 may aggregate the extractedfeatures 121-131 of one, but preferably two or more, feature sets 120A,120B, 120C, of an individual into an aggregated profile record 150 forthat individual. In some embodiments, the aggregation engine 323 maycreate an aggregated profile record 150 for an individual by aggregatingthe matching extracted feature sets 120A, 120B, 120C, (feature sets120A, 120B, 120C, having one or more extracted features 121-131 thatmatch one or more extracted features 121-131 of one or more otherfeature sets 120A, 120B, 120C) of that individual obtained in step 503from one or more data sources 111, 112, 113. In some embodiments, theaggregation engine 323 may generate or create an aggregated profilerecord 150 for each individual so that each individual may have at leastone aggregated profile record 150 and each aggregated profile record 150may comprise the data aggregated from the one or more data sources 111,112, 113, from step 520.

In step 545, the database engine 324 may store the aggregated profilerecord 150 into the system database 330. In some embodiments, theaggregated feature data (aggregated profile record 150 and/or aggregatedfeatures 151-161) for each individual may be saved in the systemdatabase 330 by the database engine 324 so that each individual may haveone aggregated profile record 150 and that aggregated profile record 150may comprise the data aggregated from the one or more data sources 111,112, 113, from step 520.

In step 550, the aggregation engine 323 may automatically self-adjustthe aggregated system database 330. In some embodiments, the aggregationengine 323 may automatically self-adjust the aggregated system database330 by periodically receiving extracted feature sets 120 for anindividual from the extraction engine 322. The aggregation engine 323may compare the features 121-131 of the feature sets 120 to the features151-161 of the aggregated profile record 150, preferably retrieved fromthe system database 330 by the database engine 324, and then aggregatethe talent data of the features 121-131 into the features 151-161 of theaggregated profile record 150 for the individual as discussed further inFIG. 12 . In this manner, the system 100 may self-adjust or self-updatethe aggregated profile records 150 of the system using data extractedfrom the data sets 115, 116, 117, of data sources 111, 112, 113.

In optional step 560, a database engine 324 may provide aggregated datafrom the system database 330 to one or more third party systems 107. Forexample, the database engine 324 may periodically provide aggregateddata from the system database 330 to a third party system 107, such asone or more websites, talent databases, human resource departments, andthe like, which may be enrolled or subscribed to the system 100. In thismanner, the one or more websites, talent databases, human resourcedepartments, and the like, which may be enrolled or subscribed to thesystem 100 may have access to continually updated talent data forindividuals having an aggregated profile record 150 in the systemdatabase 330.

In step 570, an application program interface (API) engine 325 mayprovide a self-adjusting application program interface (API). Inpreferred embodiments, the application program interface (API) engine325 may provide a self-adjusting API that may be query-able via webservices, Structured Query Language (SQL) queries, and 3rd partyvisualization tools that can support SQL, non SQL or non relational(NOSQL) dialect, or any other programmable interface as discussedfurther in FIG. 13 . The self-adjusting API may enable any number ofconsumers to search, query, and model insights into the data model ofthe system database 330 preferably via their respective client device400. After step 570, the method 500 may finish 580.

FIG. 11 depicts a block diagram of an example of a method for creatingand aggregating profile records (“the method”) 530 according to variousembodiments described herein. The method 530 may be used to extracttalent data or feature sets from one or more data sources 111, 112, 113,and to aggregate the data into the system database 330 as a singlefeature set (aggregated profile record 150) preferably for eachindividual having one or more profiles in the one or more data sources111, 112, 113. The method 530 may enable the system 100 to automaticallycorrelate and augment disparate talent datasets or feature sets forindividuals without the need for a unique identifier for eachindividual.

In some embodiments, the method 530 may start 531 and the extractionengine 322 may generate one or more extracted feature sets 120 from oneor more, and preferably each, data set 115, 116, 117, of one or moredata sources 111, 112, 113, in step 532.

In decision block 533 the aggregation engine 323 may determine if one ormore feature sets 120A, 120B, 120C, and/or their extracted features121-131 substantially match one or more other feature sets 120A, 120B,120C, and/or their extracted features 121-131 from disparate data sets115, 116, 117, of disparate data sources 111, 112, 113, withoutrequiring the feature sets 120A, 120B, 120C, to have a unique identifieror to be matched via matching unique identifiers (such as an emailaddress, social security number, etc.). In preferred embodiments, theaggregation engine 323 may determine if one or more feature sets 120A,120B, 120C, and/or their extracted features 121-131 substantially matchone or more other feature sets 120A, 120B, 120C, and/or their extractedfeatures 121-131 without requiring a unique identifier so that theaggregation engine 323 may match the feature sets 120A, 120B, 120C,without requiring an extracted feature 121-131 to be or function as aunique identifier. This is a novel feature to the system 100 and methodsof the present invention since current systems and methods are unable tomatch one or more profiles of an individual if the one or more profilesdo not have a unique identifier, such as an email address or socialsecurity number, that can be matched between the profiles.

In some embodiments, once the feature sets are extracted across datasources 111, 112, 113, the aggregation engine 323 may compare thefeature sets 120A, 120B, 120C, from the data sources 111, 112, 113, toaccurately identify the profile of an individual. For example, theaggregation engine 323 may pick feature set 120A, 120B, 120C, from afirst data source 111 and compare its features against N number of otherfeature sets 120A, 120B, 120C, from a second data source 112. Featuresets 120A, 120B, 120C, having a desired number of matching extractedfeatures 121-131 may be determined by the aggregation engine 323 todescribe the same individual, thereby eliminating the need for a uniqueidentifier to be used to match the feature sets 120A, 120B, 120C, fromthe disparate data sources 111, 112, 113. In other embodiments, theaggregation engine 323 may compare machine learning models to accuratelyidentify profiles of an individual from two or more data sources 111,112, 113. In cases where a profile from a data source 111, 112, 113,matches multiple profiles in another data source 111, 112, 113, theaggregation engine 323 may perform disambiguation checks so inaccurateprofiles are not correlated.

If the aggregation engine 323 determines one or more feature sets 120A,120B, 120C, and/or their extracted features 121-131 substantially matchone or more other feature sets 120A, 120B, 120C, and/or their extractedfeatures 121-131, the method 530 may proceed to step 534. If theaggregation engine 323 determines one or more feature sets 120A, 120B,120C, and/or their extracted features 121-131 do not match one or moreother feature sets 120A, 120B, 120C, and/or their extracted features121-131, the method 530 may proceed to step 532.

In step 534 the aggregation engine 323 may aggregate the substantiallymatching feature sets 120 for an individual into one or more aggregatedprofile records 150 and/or aggregated features 151-161. In preferredembodiments, the aggregation engine 323 may aggregate the substantiallymatching feature sets 120 for an individual into an aggregated profilerecord 150. The aggregation engine 323 may aggregate the substantiallymatching feature sets 120 for an individual by combining data in theextracted features 121-131 from the matching feature sets 120A, 120B,120C, into the aggregated features 151-161 of an aggregated profilerecord 150. For example, a current employer feature 152 may comprisedata from one or more current employer features 122A, 122C.

In step 534 the database engine 324 may store the aggregated profilerecord 150 and/or aggregated features 151-161 in the system database330. Preferably, the aggregation engine 323 and/or the database engine324 may create an aggregated profile record 150 if the individual doesnot already have an aggregated profile record 150. If the individualalready has an aggregated profile record 150 in the system database 330,the aggregation engine 323 and/or the database engine 324 may update orotherwise combine the aggregated profile record 150 and/or aggregatedfeatures 151-161 from step 533 into the existing aggregated profilerecord 150 and/or aggregated features 151-161. After step 535, themethod 530 may finish 536.

FIG. 12 shows a block diagram of an example of a method for managing aself-adjusting aggregated system database (“the method”) 550 accordingto various embodiments described herein. The method 550 may be used toprovide a self-adjusting data model as the system database 330 thatshows comprehensive data about an individual extracted from disparatedata sources. Preferably, the aggregation module 321 may automaticallyself-adjust the aggregated system database 330 by periodicallyextracting, comparing, and aggregating talent data from one or more datasources 111, 112, 113.

In some embodiments, the method 550 may start 551 and the aggregationengine 323 and/or database engine 324 may have created an aggregatedprofile record 150 for one or more individuals in the system database330 in step 552 according to method 530 shown in FIG. 11 .

Next, the aggregation engine 323 and/or database engine 324 mayautomatically self-adjust the aggregated system database 330 byadjusting, updating, or otherwise modifying the aggregated features151-161 of an individual in their aggregated profile record 150 usingdata from feature sets 120 extracted from one or more data sources 111,112, 113, in step 553. In preferred embodiments, the extraction engine322 may periodically extract feature sets 120 from one or more datasources 111, 112, 113, such as every day, twice a week, three times amonth, monthly, quarterly, or any other time period. The periodicallyobtained feature sets 120 may then be provided to the aggregation engine323 and/or database engine 324 for automatically self-adjusting theaggregated system database 330. Each feature set 120 may comprise one ormore extracted features 121-131 for an individual. The extraction engine322 may extract the feature sets 120 from data set(s) 115, 116, 117, ofdata sources 111, 112, 113, such as from public data for John Smith(candidate in Indeed), internal data for John Smith (employee in HRIS),and internal data for John Smith (potential prospect in CRM), so thatthe aggregation engine 323 may access the individual feature sets andthe feature data contained therein for each individual. As theaggregation engine 323 aggregates profiles or feature sets from acrossdifferent data sources 111, 112, 113, optionally the aggregation engine323 merges the features from the matching profiles or feature sets ofthe data sources 111, 112, 113, into the single feature set oraggregated profile for each individual. The aggregation engine 323and/or database engine 324 may update the aggregated profile records 150of system database 300 with extracted features 121-131 fromsubstantially matching feature sets 120 from the one or more datasources 111, 112, 113. In some embodiments, the aggregation engine 323and/or database engine 324 may update the aggregated profile record 150in the system database 300 of an individual with extracted features121-131 from substantially matching feature sets 120 for that individualthat were obtained from the one or more data sources 111, 112, 113. Forexample, an individual may have a profile having a feature set 120A withdata describing the individual as a candidate in a first data source 111(e.g. applicant tracking system or ATS), the individual may have aprofile having feature set 120B data describing the individual as hiredand being an employee in a second data source 112 (e.g. human resourceinformation system or HRIS), and the individual may have a profilehaving feature set 120C data describing the individual as being apotential prospect in a third data source 113 (e.g. CandidateRelationship Management or CRM). In this example, the aggregation engine323 automatically correlates them and creates a feature set 120 that isself-adjusting via the periodic aggregation of the feature sets from thedifferent or disparate data sources 111, 112, 113. If the individual isalso found in another data source 111, 112, 113, the aggregation engine323 and/or database engine 324 will automatically update the systemdatabase 330 so that it is built with the features for that individualfrom the new data source 111, 112, 113. In some embodiments, the data ofthe system database 330 could be organized as relational database,JavaScript Object Notation (JSON), XML, Text, CSV, Excel, or any otherformat.

Next, in step 554 the database engine 324 may provide aggregated datafrom the system database 330 to third party systems 107. In someembodiments, the database engine 324 may provide aggregated data fromthe system database 330 to third party systems 107 which may besubscribed to the system 100 or otherwise allowed to access data of thesystem database 330. In further embodiments, the database engine 324 mayprovide aggregated data from the system database 330 to third partysystems 107 which may request access data of the system database 330.

In step 555, the API engine 325 may provide a self adjusting applicationprogram interface which may enable data of the system database 330 to beaccessed by users 101 to perform a variety of functions with the data inthe system database 330, such as querying and modeling insights forprofiles or aggregated profile records 150 in the system database 330across range of features from the data extracted from disparate datasources 111, 112, 113, that has been aggregated into the system database330 as described in FIG. 13 . After step 555, the method 550 may finish556.

FIG. 13 illustrates a block diagram of an example of a method ofproviding a self-adjusting application program interface (“the method”)570 according to various embodiments described herein. The method 570may be used to provide a self-adjusting application program interface(API) that lets users 101 perform a variety of functions with the datain the system database 330, such as querying and modeling insights forprofiles or aggregated profile records 150 in the system database 330across range of features from the data extracted from disparate datasources 111, 112, 113, that has been aggregated into the system database330.

In some embodiments, the method 570 may start 571 and the API engine 325may receive a query or other data manipulation instruction from aprogrammable interface in steps 572, 573, and/or 574. In preferredembodiments, the self-adjusting API provided by the API engine 325 maybe query-able via a web service, a Structured Query Language (SQL)query, a 3rd party visualization tool that can support SQL, non SQL ornon relational (NOSQL) dialect, or any other programmable interface. TheAPI engine 325 may function as a self-adjusting API by enabling anynumber of consumers or users 101 to search, query, and model insightsinto the data model of the system database 330 through a firstprogrammable interface 572, a second programmable interface 573, a thirdprogrammable interface 574, and/or any other number or types ofprogrammable interfaces preferably via the client device 400 of therespective consumers or users 101.

In step 575, the aggregation module 321, via an API engine 325 and/ordatabase engine 324 may search one or more databases and data sources111, 112, 113, such as the system database 330, public database systems,Human Resources Information Systems (HRIS), and Candidate RelationshipManagement (CRM) systems, for data that satisfies respective query of aconsumer or user 101. In preferred embodiments, the aggregation module321 may search across a plurality of databases and data sources 111,112, 113, for features to populate an aggregated profile 150 frommultiple databases and multiple data sources (providing a Self-adjustingAPI) for data that satisfies respective query of a consumer or user 101.The API engine 325 may function as a self-adjusting API by enabling anynumber of consumers or users 101 to search, query, and model insightsinto the data model of the system database 330 which the API engine 325self-adjusted to aggregate talent data or feature data from one or moredisparate data sources 111, 112, 113.

In step 575, the API engine 325 may output data to the consumers orusers 101 via the respective programmable interface they originally usedthrough their respective client device 400 that satisfies theirrespective query(s). After step 575, the method 570 may finish 576.

It will be appreciated that some exemplary embodiments described hereinmay include one or more generic or specialized processors (or“processing devices”) such as microprocessors, digital signalprocessors, customized processors and field programmable gate arrays(FPGAs) and unique stored program instructions (including both softwareand firmware) that control the one or more processors to implement, inconjunction with certain non-processor circuits, some, most, or all ofthe functions of the methods and/or systems described herein.Alternatively, some or all functions may be implemented by a statemachine that has no stored program instructions, or in one or moreapplication specific integrated circuits (ASICs), in which each functionor some combinations of certain of the functions are implemented ascustom logic. Of course, a combination of the two approaches may beused. Moreover, some exemplary embodiments may be implemented as acomputer-readable storage medium having computer readable code storedthereon for programming a computer, server, appliance, device, etc. eachof which may include a processor to perform methods as described andclaimed herein. Examples of such computer-readable storage mediumsinclude, but are not limited to, a hard disk, an optical storage device,a magnetic storage device, a ROM (Read Only Memory), a PROM(Programmable Read Only Memory), an EPROM (Erasable Programmable ReadOnly Memory), an EEPROM (Electrically Erasable Programmable Read OnlyMemory), a Flash memory, and the like.

Embodiments of the subject matter and the functional operationsdescribed in this specification can be implemented in digital electroniccircuitry, or in computer software, firmware, or hardware, including thestructures disclosed in this specification and their structuralequivalents, or in combinations of one or more of them. Embodiments ofthe subject matter described in this specification can be implemented asone or more computer program products, i.e., one or more modules ofcomputer program instructions encoded on a tangible program carrier forexecution by, or to control the operation of, data processing apparatus.The tangible program carrier can be a propagated signal or a computerreadable medium. The propagated signal is an artificially generatedsignal, e.g., a machine generated electrical, optical, orelectromagnetic signal that is generated to encode information fortransmission to suitable receiver apparatus for execution by a computer.The computer readable medium can be a machine readable storage device, amachine readable storage substrate, a memory device, a composition ofmatter effecting a machine readable propagated signal, or a combinationof one or more of them.

A computer program (also known as a program, software, softwareapplication, application, script, or code) can be written in any form ofprogramming language, including compiled or interpreted languages, ordeclarative or procedural languages, and it can be deployed in any form,including as a standalone program or as a module, component, subroutine,or other unit suitable for use in a computing environment. A computerprogram does not necessarily correspond to a file in a file system. Aprogram can be stored in a portion of a file that holds other programsor data (e.g., one or more scripts stored in a markup languagedocument), in a single file dedicated to the program in question, or inmultiple coordinated files (e.g., files that store one or more modules,sub programs, or portions of code). A computer program can be deployedto be executed on one computer or on multiple computers that are locatedat one site or distributed across multiple sites and interconnected by acommunication network.

Additionally, the logic flows and structure block diagrams described inthis patent document, which describe particular methods and/orcorresponding acts in support of steps and corresponding functions insupport of disclosed structural means, may also be utilized to implementcorresponding software structures and algorithms, and equivalentsthereof. The processes and logic flows described in this specificationcan be performed by one or more programmable processors executing one ormore computer programs to perform functions by operating on input dataand generating output.

Processors suitable for the execution of a computer program include, byway of example, both general and special purpose microprocessors, andany one or more processors of any kind of digital computer. Generally, aprocessor will receive instructions and data from a read only memory ora random access memory or both. The essential elements of a computer area processor for performing instructions and one or more memory devicesfor storing instructions and data. Generally, a computer will alsoinclude, or be operatively coupled to receive data from or transfer datato, or both, one or more mass storage devices for storing data, e.g.,magnetic, magneto optical disks, solid state drives, or optical disks.However, a computer need not have such devices.

Computer readable media suitable for storing computer programinstructions and data include all forms of non volatile memory, mediaand memory devices, including by way of example semiconductor memorydevices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks,e.g., internal hard disks or removable disks; magneto optical disks; andCD ROM and DVD ROM disks. The processor and the memory can besupplemented by, or incorporated in, special purpose logic circuitry.

To provide for interaction with a user, embodiments of the subjectmatter described in this specification can be implemented on a computerhaving a display device, e.g., a CRT (cathode ray tube) or LCD (liquidcrystal display) monitor, for displaying information to the user and akeyboard and a pointing device, e.g., a mouse or a trackball, by whichthe user can provide input to the computer. Other kinds of devices canbe used to provide for interaction with a user as well; for example,feedback provided to the user can be any form of sensory feedback, e.g.,visual feedback, auditory feedback, or tactile feedback; and input fromthe user can be received in any form, including acoustic, speech, ortactile input.

Embodiments of the subject matter described in this specification can beimplemented in a computing system that includes a back end component,e.g., as a data server, or that includes a middleware component, e.g.,an application server, or that includes a front end component, e.g., aclient computer having a graphical user interface or a Web browserthrough which a user can interact with an implementation of the subjectmatter described is this specification, or any combination of one ormore such back end, middleware, or front end components. The componentsof the system can be interconnected by any form or medium of digitaldata communication, e.g., a communication network. Examples ofcommunication networks include a local area network (“LAN”) and a widearea network (“WAN”), e.g., the Internet.

The computing system can include clients and servers. A client andserver are generally remote from each other and typically interactthrough a communication network or the cloud. The relationship of clientand server arises by virtue of computer programs running on therespective computers and having a client server relationship to eachother.

Further, many embodiments are described in terms of sequences of actionsto be performed by, for example, elements of a computing device. It willbe recognized that various actions described herein can be performed byspecific circuits (e.g., application specific integrated circuits(ASICs)), by program instructions being executed by one or moreprocessors, or by a combination of both. Additionally, these sequence ofactions described herein can be considered to be embodied entirelywithin any form of computer readable storage medium having storedtherein a corresponding set of computer instructions that upon executionwould cause an associated processor to perform the functionalitydescribed herein. Thus, the various aspects of the invention may beembodied in a number of different forms, all of which have beencontemplated to be within the scope of the claimed subject matter. Inaddition, for each of the embodiments described herein, thecorresponding form of any such embodiments may be described herein as,for example, “logic configured to” perform the described action.

The computer system may also include a main memory, such as a randomaccess memory (RAM) or other dynamic storage device (e.g., dynamic RAM(DRAM), static RAM (SRAM), and synchronous DRAM (SDRAM)), coupled to thebus for storing information and instructions to be executed byprocessor. In addition, the main memory may be used for storingtemporary variables or other intermediate information during theexecution of instructions by the processor. The computer system mayfurther include a read only memory (ROM) or other static storage device(e.g., programmable ROM (PROM), erasable PROM (EPROM), and electricallyerasable PROM (EEPROM)) coupled to the bus for storing staticinformation and instructions for the processor.

The computer system may also include a disk controller coupled to thebus to control one or more storage devices for storing information andinstructions, such as a magnetic hard disk, and a removable media drive(e.g., floppy disk drive, read-only compact disc drive, read/writecompact disc drive, compact disc jukebox, tape drive, and removablemagneto-optical drive). The storage devices may be added to the computersystem using an appropriate device interface (e.g., small computersystem interface (SCSI), integrated device electronics (IDE),enhanced-IDE (E-IDE), direct memory access (DMA), or ultra-DMA).

The computer system may also include special purpose logic devices(e.g., application specific integrated circuits (ASICs)) or configurablelogic devices (e.g., simple programmable logic devices (SPLDs), complexprogrammable logic devices (CPLDs), and field programmable gate arrays(FPGAs)).

The computer system may also include a display controller coupled to thebus to control a display, such as a cathode ray tube (CRT), liquidcrystal display (LCD) or any other type of display, for displayinginformation to a computer user. The computer system may also includeinput devices, such as a keyboard and a pointing device, for interactingwith a computer user and providing information to the processor.Additionally, a touch screen could be employed in conjunction withdisplay. The pointing device, for example, may be a mouse, a trackball,or a pointing stick for communicating direction information and commandselections to the processor and for controlling cursor movement on thedisplay. In addition, a printer may provide printed listings of datastored and/or generated by the computer system.

The computer system performs a portion or all of the processing steps ofthe invention in response to the processor executing one or moresequences of one or more instructions contained in a memory, such as themain memory. Such instructions may be read into the main memory fromanother computer readable medium, such as a hard disk or a removablemedia drive. One or more processors in a multi-processing arrangementmay also be employed to execute the sequences of instructions containedin main memory. In alternative embodiments, hard-wired circuitry may beused in place of or in combination with software instructions. Thus,embodiments are not limited to any specific combination of hardwarecircuitry and software.

As stated above, the computer system includes at least one computerreadable medium or memory for holding instructions programmed accordingto the teachings of the invention and for containing data structures,tables, records, or other data described herein. Examples of computerreadable media are compact discs, hard disks, floppy disks, tape,magneto-optical disks, PROMs (EPROM, EEPROM, flash EPROM), DRAM, SRAM,SDRAM, or any other magnetic medium, compact discs (e.g., CD-ROM), orany other optical medium, punch cards, paper tape, or other physicalmedium with patterns of holes, a carrier wave (described below), or anyother medium from which a computer can read.

Stored on any one or on a combination of computer readable media, thepresent invention includes software for controlling the computer system,for driving a device or devices for implementing the invention, and forenabling the computer system to interact with a human user. Suchsoftware may include, but is not limited to, device drivers, operatingsystems, development tools, and applications software. Such computerreadable media further includes the computer program product of thepresent invention for performing all or a portion (if processing isdistributed) of the processing performed in implementing the invention.

The computer code or software code of the present invention may be anyinterpretable or executable code mechanism, including but not limited toscripts, interpretable programs, dynamic link libraries (DLLs), Javaclasses, and complete executable programs. Moreover, parts of theprocessing of the present invention may be distributed for betterperformance, reliability, and/or cost.

Various forms of computer readable media may be involved in carrying outone or more sequences of one or more instructions to processor forexecution. For example, the instructions may initially be carried on amagnetic disk of a remote computer. The remote computer can load theinstructions for implementing all or a portion of the present inventionremotely into a dynamic memory and send the instructions over the air(e.g. through a wireless cellular network or wifi network). A modemlocal to the computer system may receive the data over the air and usean infrared transmitter to convert the data to an infrared signal. Aninfrared detector coupled to the bus can receive the data carried in theinfrared signal and place the data on the bus. The bus carries the datato the main memory, from which the processor retrieves and executes theinstructions. The instructions received by the main memory mayoptionally be stored on storage device either before or after executionby processor.

The computer system also includes a communication interface coupled tothe bus. The communication interface provides a two-way datacommunication coupling to a network link that is connected to, forexample, a local area network (LAN), or to another communicationsnetwork such as the Internet. For example, the communication interfacemay be a network interface card to attach to any packet switched LAN. Asanother example, the communication interface may be an asymmetricaldigital subscriber line (ADSL) card, an integrated services digitalnetwork (ISDN) card or a modem to provide a data communicationconnection to a corresponding type of communications line. Wirelesslinks may also be implemented. In any such implementation, thecommunication interface sends and receives electrical, electromagneticor optical signals that carry digital data streams representing varioustypes of information.

The network link typically provides data communication to the cloudthrough one or more networks to other data devices. For example, thenetwork link may provide a connection to another computer or remotelylocated presentation device through a local network (e.g., a LAN) orthrough equipment operated by a service provider, which providescommunication services through a communications network. In preferredembodiments, the local network and the communications network preferablyuse electrical, electromagnetic, or optical signals that carry digitaldata streams. The signals through the various networks and the signalson the network link and through the communication interface, which carrythe digital data to and from the computer system, are exemplary forms ofcarrier waves transporting the information. The computer system cantransmit and receive data, including program code, through thenetwork(s) and, the network link and the communication interface.Moreover, the network link may provide a connection through a LAN to aclient device such as a personal digital assistant (PDA), laptopcomputer, or cellular telephone. The LAN communications network and theother communications networks such as cellular wireless and wifinetworks may use electrical, electromagnetic or optical signals thatcarry digital data streams. The processor system can transmitnotifications and receive data, including program code, through thenetwork(s), the network link and the communication interface.

Although the present invention has been illustrated and described hereinwith reference to preferred embodiments and specific examples thereof,it will be readily apparent to those of ordinary skill in the art thatother embodiments and examples may perform similar functions and/orachieve like results. All such equivalent embodiments and examples arewithin the spirit and scope of the present invention, are contemplatedthereby, and are intended to be covered by the following claims.

What is claimed is:
 1. A computer implemented method for disparate datasource aggregation, the method comprising the steps of: extracting, viaa computing device processor, a first feature set having a plurality offirst extracted features from a first data set of a first data sourceand extracting via a computing device processor, a second feature sethaving a plurality of second extracted features from a second data setof a second data source and wherein both the first data set and thesecond data set do not have a common unique identifier; determining, viaa computing device processor, whether the first feature set matches thesecond feature set, wherein the determining whether the first featureset matches the second feature set comprises comparing the plurality offirst extracted features to the plurality of second extracted featuresto determine a number of matching extracted features between the firstfeature set and the second feature set; in response to determining thatthe number of matching extracted features between the first feature setand the second feature set meets or exceeds a matching threshold number,aggregating, via a computing device processor, the first feature setwith the second feature set; and storing, via a computing deviceprocessor, the aggregated first feature set and aggregated secondfeature set as an aggregated profile record in a system database.
 2. Thecomputer implemented method of claim 1, further comprising automaticallyself-adjusting the aggregated profile record of the system database. 3.The computer implemented method of claim 1, wherein the first extractedfeature and the second extracted feature are extracted via naturallanguage processing.
 4. The computer implemented method of claim 1,wherein the first extracted feature is periodically extracted from thefirst data set, and wherein the second extracted feature is periodicallyextracted from the second data set.
 5. The computer implemented methodof claim 1, further providing, via computing device processor,aggregated data from the system database to a third party system.
 6. Thecomputer implemented method of claim 1, further comprising providing,via computing device processor, a self-adjusting application programinterface.
 7. The computer implemented method of claim 1, furthercomprising a self-adjusting application program interface operation, theoperation comprising: receiving, via a computing device processor, afirst query from a first programmable interface; searching, via acomputing device processor, a system database for data that satisfiesthe first query; and outputting data, via a computing device processorand the first programmable interface, data that satisfies the firstquery.
 8. The computer implemented method of claim 7, wherein the firstquery comprises a request for a feature from an aggregated profilerecord.
 9. The computer implemented method of claim 7, wherein the firstprogrammable interface comprises a programmable interface selected fromthe group consisting of a web service, a Structured Query Languagequery, a 3rd party visualization tool that can support Structured QueryLanguage, a 3rd party visualization tool that can support non relationalStructured Query Language.
 10. The computer implemented method of claim7, further comprising storing, via a computing device processor, theaggregated first feature set and second feature set in an aggregatedprofile record in a system database.
 11. The computer implemented methodof claim 7, further comprising automatically self-adjusting theaggregated profile record of the system database.
 12. The computerimplemented method of claim 7, wherein the first extracted feature andthe second extracted feature are extracted via natural languageprocessing.
 13. The computer implemented method of claim 7, wherein thefirst extracted feature is periodically extracted from the first dataset, and wherein the second extracted feature is periodically extractedfrom the first data set.
 14. The computer implemented method of claim 7,further comprising providing, via computing device processor, aggregateddata from the system database to a third party system.
 15. The computerimplemented method of claim 7, further comprising providing, viacomputing device processor, a self-adjusting application programinterface.
 16. The computer implemented method of claim 1, wherein thefirst data set comprises structured data and the second data setcomprises unstructured data.
 17. The computer implemented method ofclaim 16, wherein the unstructured data is a resume document file. 18.The computer implemented method of claim 16, wherein the unstructureddata is an image file.
 19. The computer implemented method of claim 16,wherein the unstructured data is a web page.
 20. A non-transitory,computer-readable medium coupled to one or more processors and havinginstructions stored thereon which, when executed by the one or moreprocessors, cause the one or more processors to perform operations, theoperations comprising: extracting, via a computing device processor, afirst feature set having a plurality of first extracted features from afirst data set of a first data source and extracting via a computingdevice processor, a second feature set having a plurality of secondextracted features from a second data set of a second data source andwherein both the first data set and the second data set do not have acommon unique identifier; determining, via a computing device processor,whether the first feature set matches the second feature set, whereinthe determining whether the first feature set matches the second featureset comprises comparing the plurality of first extracted features to theplurality of second extracted features to determine a number of matchingextracted features between the first feature set and the second featureset; in response to determining that the number of matching extractedfeatures between the first feature set and the second feature set meetsor exceeds a matching threshold number, aggregating, via a computingdevice processor, the first feature set with the second feature set; andstoring, via a computing device processor, the aggregated first featureset and aggregated second feature set as an aggregated profile record ina system database.